Back

Psychological Review

American Psychological Association (APA)

Preprints posted in the last 90 days, ranked by how well they match Psychological Review's content profile, based on 19 papers previously published here. The average preprint has a 0.01% match score for this journal, so anything above that is already an above-average fit.

1
Contrasting Probabilistic and Intentional Accounts of Confidence in Perceptual Decisions

Zylberberg, A.

2026-03-30 animal behavior and cognition 10.64898/2026.03.24.714055 medRxiv
Top 0.1%
17.1%
Show abstract

The ability to evaluate ones own knowledge states is often studied using paradigms in which participants make a decision and subsequently report their confidence. This structure has motivated hierarchical models in which confidence arises from a metacognitive process, distinct from the decision process itself, that estimates the probability that the choice is correct (Meyniel et al., 2015; Pouget et al., 2016; Fleming and Daw, 2017). Here, we contrast this framework with an alternative based on an intentional architecture (Shadlen et al., 2008). In this account, choice and confidence are determined simultaneously through a multidimensional drift-diffusion process, where each dimension represents one choice-confidence combination (Ratcliff and Starns, 2009, 2013). Choice, response time, and confidence jointly emerge when one of these accumulators reaches a decision bound. To adjudicate between these accounts, we fit both models to behavioral data from two perceptual tasks: a random-dots motion discrimination task with incentivized confidence reports, and a luminance discrimination task without feedback or incentives. The integrated model provided a superior fit for the incentivized motion task, whereas the hierarchical model more accurately captured behavior in the un-incentivized luminance task. These results suggest that confidence does not rely on a single computational mechanism, but rather its implementation may adapt to the specific demands and structure of the task.

2
The Metacognitive Sensitivity of Verbal and Numerical Confidence Reports

Zylberberg, A.; Alvarez Heduan, F.

2026-05-18 animal behavior and cognition 10.64898/2026.05.13.724887 medRxiv
Top 0.1%
4.7%
Show abstract

We study how confidence in perceptual decisions depends on whether it is communicated verbally (e.g., "very likely") or numerically (e.g., "80% certainty"). We find that verbal expressions more reliably distinguish correct from incorrect choices than numerical reports, challenging the common assumption that numerical probabilities provide more precise representations of uncertainty. Additionally, in a dyadic decision-making task in which participants can revise their initial reports based on a partners choice and expressed confidence, verbal and numerical reports are equally effective in supporting accurate revisions of initial judgments. Together, these results underscore the effectiveness of verbal expressions as a means of conveying decision confidence.

3
A computational account of how positive performance bias supports cognitive effort

Mori, K.; Yamada, M.

2026-05-18 neuroscience 10.64898/2026.05.13.725021 medRxiv
Top 0.1%
3.9%
Show abstract

The willingness to exert cognitive effort is essential but is constrained by the subjective cost of effort. Although effortful tasks are often avoided, positive bias about ones own performance may help sustain engagement with cognitive demands. Here, participants completed an effort-based decision-making task and reported trial-by-trial predictions of their own performance, allowing us to quantify performance prediction error (PPE) as the discrepancy between subjective and objective accuracy. The results showed that PPE was predominantly positive and increased with effort level, indicating greater overestimation under higher cognitive demands. Using a computational model, we show that choices were best explained by a learning model in which rewarded trials accompanied by positive PPE decreased subsequent sensitivity to effort. A confidence-based control model did not provide a better account of choices, suggesting that this effect was better captured by positive performance bias than by confidence alone. Our findings provide a computational account of how biased self-evaluation may attenuate the subjective cost of cognitive effort and extend the positive bias literature to the task need for cognitive effort.

4
Explaining temporally clustered errors with an autocorrelated Drift Diffusion Model

Vloeberghs, R.; Tuerlinckx, F.; Urai, A. E.; Desender, K.

2026-03-23 neuroscience 10.64898/2026.03.20.713186 medRxiv
Top 0.1%
3.1%
Show abstract

A widely used framework for studying the computational mechanisms of decision making is the Drift Diffusion Model (DDM). To account for the presence of both fast and slow errors in empirical data, the DDM incorporates across-trial variability in parameters such as the drift rate and the starting point. Although these variability parameters enable the model to reproduce both fast and slow errors, they rely on the assumption that over trials each parameter is independently sampled. As a result, the DDM effectively predicts that errors-- whether fast or slow--occur randomly over time. However, in empirical data this assumption is violated, as error responses are often temporally clustered. To address this limitation, we introduce the autocorrelated DDM, in which trial-to-trial fluctuations in drift rate, starting point, and boundary evolve according to first-order autoregressive (AR1) processes. Using simulations, we demonstrate that, unlike the across-trial variability DDM, the autocorrelated DDM naturally accounts for temporal clustering of errors. We further show that model parameters can be reliably recovered using Amortized Bayesian Inference, even with as few as 500 trials. Finally, fits to empirical data indicate that the autocorrelated DDM provides the best account of error clustering, highlighting that computational parameters fluctuate over time, despite typically being estimated as fixed across trials.

5
Confirmation Bias Exists in the Face of False Information

Razi, H.; Sambrook, T.; Garrett, N.

2026-05-11 neuroscience 10.64898/2026.05.07.723487 medRxiv
Top 0.1%
3.1%
Show abstract

Confirmation bias impacts judgments and decisions across a range of domains including finance, policy and science. Here we examine whether explicitly labelling information as true or false disrupts a core underlying computational mechanism that can generate this pervasive bias - asymmetric learning. Human participants (Study 1: N=47; Study 2: N=57) completed a 2 alternative forced choice (2AFC) task previously used to test for the presence of confirmation bias. Participants made choices between pairs of options that could win or lose money and received either factual or counterfactual feedback after each choice. We introduced a key novel feature into the task - providing explicit cues that signalled to participants whether feedback they had seen was true (verified) or false (debunked). Learning in response to feedback was attenuated under false compared to true labels but was present under both. Fitting participants choices to computational models enabled us to examine how sensitivity to the feedback varied as a function of both the label (true/false) and confirmation (confirmatory/disconfirmatory). This revealed a distinct pattern of learning rates typical of confirmation bias (enhanced learning from positive prediction errors for chosen options and from negative prediction errors for unchosen options) in response to both true and false labels. The findings highlight how confirmation bias plays an important role in the effectiveness of interventions designed to verify true and/or debunk false claims. Verification is less likely to succeed when information disconfirms prior beliefs. Conversely, debunking false claims is unlikely to succeed when the information confirms ones prior beliefs.

6
Human decision-makers terminate evidence accumulation using flexible decision rules

Kalburge, I.; Dallstream, A.; Josic, K.; Kilpatrick, Z. P.; Ding, L.; Gold, J. I.

2026-03-20 neuroscience 10.64898/2026.03.18.712662 medRxiv
Top 0.1%
3.0%
Show abstract

Decisions based on evidence accumulated over time require rules governing when to end the accumulation process and commit to a choice. These rules control inherent trade-offs between decision speed and accuracy, which require careful balance to maximize quantities that depend on both like reward rate. We previously showed that, to maximize reward rate, normative decision rules adapt to changing task conditions (Barendregt et al., 2022). Here we used a novel task to examine whether and how people use adaptive rules for individual decisions under a variety of conditions, including changes in decision outcomes across trials and changes in evidence quality both across and within trials. We found that the participants tended to use rules that adjusted, at least partially, to predictable changes in task conditions to improve reward rate, consistent with a rationally bounded implementation of normative principles. These findings help inform our understanding of the extent and limits of flexible decision formation in the brain.

7
The first step is not always the hardest: A change-point analysis of predictive learning

Diekmann, N.; Lissek, S.; Uengoer, M.; Cheng, S.

2026-03-19 neuroscience 10.64898/2026.03.17.712476 medRxiv
Top 0.1%
2.1%
Show abstract

The progress of learning is usually quantified by averaging responses across participants and/or multiple trials within a block. However, such approaches obscure the trial-by-trial progress of learning, which has been shown recently to express a rich variety of dynamics. An alternative approach which does not suffer from this problem is the detection and analysis of points of behavioral change, i.e., change-point analysis. Using change-point analysis, we reanalyzed data from human participants in different predictive learning tasks in which learned contingencies underwent reversal. We find that responses of individual participants were more accurately characterized by behavioral change points than the average learning curve. Importantly, change points significantly shifted to later trials during reversal learning indicating that reversal learning is more difficult than the initial learning. In a computational model based on deep reinforcement learning, we show that the change point shift required the replay of previous experiences, which in turn depends on the hippocampus. This finding is consistent with studies showing that lesions of the hippocampus yield faster reversal learning. In summary, we reaffirm the importance of the analysis of single participant responses, show that phenomenological learning rates are slower during reversal learning, and provide a theoretical account for this difference.

8
The resource-rational dynamics of evidence accumulation

Fang, M.; Mao, J.; Donner, T. H.; Stocker, A. A.

2026-04-20 animal behavior and cognition 10.64898/2026.04.15.718716 medRxiv
Top 0.1%
1.9%
Show abstract

Evidence accumulation is a fundamental aspect of human decision-making. However, how the precise temporal structure of evidence shapes the accumulation process has not been systematically studied. As a result, current understanding of evidence accumulation remains largely limited to its time-averaged behavior. We tested human subjects in a visual estimation task in which they inferred the angular position of an unknown source from a noisy stimulus sequence. Introducing systematic temporal perturbations, i.e., breaks of different durations and at different positions in the otherwise regular evidence sequence, revealed that subjects actively compensated for the memory loss endured during the break by dynamically enhancing evidence integration and memory maintenance immediately after the break. We derived a new time-continuous Bayesian updating model that is dynamically constrained by optimal performance-effort trade-offs. With two free parameters determining the overall resource-efficiencies of encoding and memory maintenance, the model accurately predicts the rich dependencies of subjects accumulation behavior on the evidence schedule, including subjects individual tendencies to emphasize either early (primacy) or late (recency) samples in the evidence sequence. Our results suggest that evidence accumulation is a non-stationary, dynamically controlled process that optimally balances the information gained from incoming evidence against the cognitive effort required to acquire and maintain it. The proposed model is general and should apply broadly across many task domains.

9
Towards the definition and measurement of routines and the cognitive processes that underpin their maintenance

Nolan, C. R.; Le Pelley, M. E.; Garner, K. G.

2026-03-28 neuroscience 10.64898/2026.03.26.714585 medRxiv
Top 0.1%
1.8%
Show abstract

The benefits of routines for daily functioning are widely acknowledged, yet, despite their apparent importance, methods for quantifying routine maintenance and the causes of their disruption remain lacking. Here, we propose a novel means of defining and quantifying routines (transition entropy). Using the transition entropy, we show that routines can be robustly elicited on tasks that require searching through a grid of squares for a hidden target. Over two experiments (N=100 each), we show that use of routines--as quantified by transition entropy--is robustly perturbed by frequent switches between search grids, as locations specific to the currently irrelevant grid become competitive for selection. Using a normative model that tracks task dynamics, we show that disruption to routines can be attributed to reduced sensitivity to the odds of success for completing a task. This suggests that routine maintenance may be disrupted by over-sensitivity to a lack of reward early in routine performance, or increased expectations regarding the utility of pursuing other tasks.

10
Memory reactivation during sleep promotes structure abstraction

Solomon, S. H.; Krishnamurthy, S.; Siefert, E. M.; Gonciulea, C. M.; Schapiro, A. C.

2026-04-11 neuroscience 10.64898/2026.04.10.717748 medRxiv
Top 0.1%
1.7%
Show abstract

We readily detect structure in our environments, which in turn guides future learning. Disentangling this structure from the superficial features of a specific learning environment provides an especially strong basis for future generalization, but it remains unclear when and how this kind of abstraction occurs. Memory reactivation during sleep has been hypothesized to support such abstraction, but this has yet to be directly tested. Here we examined this hypothesis by teaching participants novel categories in which patterns of feature covariation were governed by different graph structures. Participants then learned a new category, defined by entirely different features, whose structure was either congruent or incongruent with a previously learned category. If structural knowledge is abstracted away from superficial features, it should facilitate transfer when structures are congruent. In Experiment 1, when two categories were learned in immediate succession, participants showed no transfer benefit, suggesting that structure understanding remained tied to the original features. In Experiment 2, we tested whether offline processing promotes abstraction. Participants either remained awake between learning phases spaced 3 hours apart, or took a nap during which a previously learned category was reactivated using targeted memory reactivation (TMR). Transfer benefits emerged only when the reactivated and target categories shared the same structure, and these benefits increased with the number of cues presented during slow-wave sleep. These findings provide the first direct evidence that memory reactivation during sleep promotes the abstraction of structure, enabling knowledge to transfer across learning episodes with no overlap in features.

11
Space-based and object-based saccadic selection in visual working memory

Shurygina, O.; Wirth, L. A.; Rolfs, M.; Ohl, S.

2026-05-10 neuroscience 10.64898/2026.05.05.723053 medRxiv
Top 0.1%
1.7%
Show abstract

Saccades made during memory maintenance prioritize memory for the saccade target, but it is unclear if this benefit is specific to a location or extends across memorized objects. In three experiments, we examined whether saccadic selection spreads to other locations within the same object. In Experiment 1, we asked observers to remember three oriented Gabors presented either within contour-defined objects or without object structure. A subsequent movement cue prompted observers to move their eyes to the indicated location. We then probed memory for stimuli at locations equidistant from the saccade target, in either the same or a different object. Memory was best for stimuli at locations congruent with the saccade target, and consistently weaker for other stimuli presented in the same or a different object than the saccade target. In Experiment 2, we created more complex objects by adding more object features to the stimulus. Again, memory performance was best for stimuli congruent with the saccade target location, whereas memory in incongruent trials was worse and similar for stimuli in the same and different object as the saccade target. In Experiment 3, we tested if saccadic selection is present and propagates within the object in a change detection task. Again, memory performance (i.e., change detection) was best at the saccade target location. However, this memory benefit also spread to other locations within the same object. Our results imply that saccadic selection in visual working memory is primarily space-based but can also spread towards locations within the object where a saccade was directed.

12
Determinants of persistence in sequential effort-based decision-making

Chaigneau, A.; Moretti, R.; Iodice, P.; Pessiglione, M.; Pezzulo, G.

2026-05-14 neuroscience 10.64898/2026.05.11.723817 medRxiv
Top 0.1%
1.7%
Show abstract

Goal-directed behavior often requires sustained effort across a sequence of interdependent decisions, yet the determinants of persistence in such contexts remain poorly understood. Here, we investigated how individuals regulate persistence in a novel sequential effort-based task in which they controlled an avatar through successive checkpoints to reach a final goal and could make repeated attempts following failure. At each attempt, participants could choose either to persist in the same task or to disengage toward an easier but less rewarding alternative. We found that decisions to persist or disengage were jointly shaped by multiple interacting factors. Disengagement increased with task difficulty and lower skill level. It also increased with repeated attempts and time-on-task, indexing fatigue, and with accumulated errors, indexing lack of progress. Conversely, proximity to the goal promoted persistence and shaped decision dynamics by reducing choice conflict during persistence decisions and increasing hesitation during disengagement near the goal. Notably, clearing the first checkpoint produced a sharp increase in persistence, suggesting that early success plays a pivotal role. Furthermore, persistence reflected both retrospective and prospective evaluations of effort, with prior investment promoting commitment and anticipated effort reducing it. Finally, disengagement was preceded by short-term performance decline but not by gradual increases in decision conflict, suggesting relatively abrupt strategy shifts following repeated failures. Together, these findings provide a comprehensive account of persistence in sequential effortful tasks, showing that decisions to persist or disengage are jointly shaped by multiple factors related to fatigue, (lack of) progress, goal proximity, and early success.

13
Confidence Judgments Reflect the Standard Error of Noisy Evidence Samples Across Domains

West, R. K.; Sewell, D. K.; Scheibehenne, B.

2026-04-22 neuroscience 10.64898/2026.04.20.719573 medRxiv
Top 0.1%
1.6%
Show abstract

Confidence judgments play a critical role in guiding behavior by shaping information-seeking, learning, and decision strategies. These functions are most effective when confidence is well calibrated, that is, when subjective uncertainty aligns with the objective uncertainty in the presented evidence. Motivated by this, we investigated how people form confidence judgments from noisy samples of information, and whether they use statistically grounded strategies or rely on heuristics. Participants performed two categorization tasks, one with visual orientation stimuli and one with number stimuli. In each task, participants saw sequentially presented observations and made a decision about the generating category and simultaneously reported their confidence in that decision. We independently manipulated the number of observations and standard deviation of the sample to assess whether confidence reflected an integrated estimate of both sources of statistical uncertainty. Behaviorally, confidence and accuracy both increased with larger sample sizes and lower variability. Furthermore, confidence and accuracy were equivalent in samples matched for standard error, suggesting that participants relied on a statistically grounded strategy. Computational modeling further supported this interpretation: a model that scaled confidence according to the standard error of the sample mean provided the best fit to the data, outperforming more heuristic and Bayesian alternatives. This pattern generalized across the orientation and number tasks, suggesting a domain-general strategy for uncertainty estimation. Together, these findings demonstrate that people use structured, statistically grounded strategies to compute their confidence, supporting well-calibrated decision-making even in the absence of full Bayesian inference.

14
Asymmetric Reinforcement Learning Explains Human Choice Patterns in Decision-making Under Risk

Shahdoust, N.; Cowan, R. L.; Price, T. A.; Davis, T. S.; Liu, A.; Rabinovich, R.; Zarr, V.; Libowitz, M. R.; Shofty, B.; Rahimpour, S.; Borisyuk, A.; Smith, E. H.

2026-03-11 neuroscience 10.64898/2026.03.09.710615 medRxiv
Top 0.1%
1.5%
Show abstract

Human decisions under uncertainty are shaped by experience, but the computations that translate expectation and experience into choice remain debated in neural and cognitive science. Prior studies highlight reinforcement learning (RL) as a unifying framework, yet it is unclear whether human behavior under risk is better captured by symmetric updating from outcomes or by asymmetric learning that weights reward and loss differently. This work examines which learning strategies better explain trial-by-trial choices given contextual uncertainty and manipulations of outcome distributions. Our results show that a Risk Sensitive (RS) model with asymmetric learning rates best explains human behavior in our novel decision-making task. Fitting candidate models to individual trial histories yielded value signals that predicted both choice and response time. These results highlight that RS model, as an asymmetric learning provides a concise and identifiable account of behavior in decision-making under risk tasks.

15
Capturing learning on the fly: an eye-tracking method to quantify prediction errors and updating the prior

Hann, F.; Nagy, C. A.; Nagy, Z. O.; Nemeth, D.; Pesthy, O.

2026-03-11 neuroscience 10.64898/2026.03.09.710486 medRxiv
Top 0.1%
1.4%
Show abstract

The ability to build predictive models of the environment fundamentally drives adaptive behavior. Yet, the real-time dynamics of how these internal models are formed and updated remain poorly understood. Conventional methods often rely on indirect, offline measures or noisy motor responses, limiting insight into the fine-grained computational processes underlying learning. Here, we introduce a generalizable, gaze-based analytical framework that directly tracks the trial-by-trial dynamics of expectation formation and updating. Applying this framework to an unsupervised probabilistic learning task, we categorized anticipatory saccades to dissociate prediction errors arising from environmental stochasticity from those reflecting an inaccurate internal model, and quantified how these predictions were iteratively revised. Learners differentiated between these error types: noise-driven errors were more likely to happen, and triggered less updates than errors reflecting insufficient knowledge of the regularity. At the same time, participants exhibited a strong preference to repeat their previous predictions. This repetition bias was amplified when predictions aligned with the underlying regularity, but was also present for non-aligned responses. Critically, updating depended more strongly on whether a prior belief was consistent with the tasks probabilistic structure than on whether the predicted stimulus matched the actual, presented stimulus. These findings suggest that statistical learning may not strongly be driven by errors; rather, it may rely on conservative updating with relatively low learning rate, or, on a Hebbian, repetition-based process. Our framework thus offers a dual contribution: a broadly applicable tool for quantifying real-time expectations, and evidence for a learning strategy that prioritizes model stability in noisy environments.

16
From flexible to anticipatory processing: alpha and beta oscillatory signatures of feedback-guided strategy adaptation and memory updating

Al Safadi, M.; Chatburn, A.; Cross, Z.; Dawson, S.; bornkessel-schlesewsky, I.

2026-05-11 neuroscience 10.64898/2026.05.10.724182 medRxiv
Top 0.1%
1.2%
Show abstract

When humans learn under conditions of uncertainty, they dynamically adjust how they prepare for and respond to feedback. In navigating uncertain environments, the brain minimizes error by continuously refining internal models via memory updating (MU). Feedback is critical for MU, and anticipatory neural mechanisms shape how feedback is processed, likely reflecting learned environmental certainty. However, the literature has largely focused on post-feedback activity, leaving pre-feedback certainty-related mechanisms less understood. The present study aims to address this gap by examining how certainty modulates anticipatory states, preceding feedback and subsequent MU. We examined oscillatory activity prior to performance feedback in a reanalysis of EEG data previously published by Hassall and colleagues (2023). Twenty-one participants (16 female, Mage = 25.81 years) predicted the strength of cartoon characters with varying predictability levels which were learned through exposure. Feedback on prediction accuracy was presented via an animated rising bar. Results revealed that theta power is modulated by accumulative feedback. Linear mixed-effects models revealed an interaction between predictability-related certainty and learning stage: in late learning, higher performance was associated with increased pre-feedback alpha and beta power for low-certainty trials, whereas in early learning, higher performance was associated with decreased beta power. These learning-related modulations in alpha and beta power suggest that initial learning is marked by adaptable exploratory processing. Subsequent learning exhibited increased alpha-mediated inhibition and beta-related anticipatory activity for lower certainty trials, indicative of dynamic strategy refinement and selective engagement of task-relevant information. These results demonstrate that certainty shapes preparatory oscillatory activity associated with MU.

17
Multitasking boosts muscular endurance task performance due to elevated arousal level unattainable by the endurance task alone

Nagisa, S.; Oblak, E.; Shimojo, S.; Shibata, K.

2026-03-10 neuroscience 10.64898/2026.03.06.710139 medRxiv
Top 0.1%
1.2%
Show abstract

Multitasking is generally regarded as detrimental to performance. This deterioration effect is typically explained by the interference among tasks due to the limited capacity of information-processing resources, which in turn reduces the performance in each task. Contrary to this general view, we report evidence for a facilitation effect of multitasking on performance. This facilitation effect was observed in multitasking on a handgrip muscular endurance task and cognitive task, which are known to have little interference with each other. Specifically, we found that performance in the endurance task was facilitated with the difficulty of the concurrent cognitive task. This facilitation effect was mediated by additional pupil dilation due to the cognitive task. Increased effort with the difficulty of the cognitive task cannot explain the facilitated performance in the irrelevant endurance task. Instead, they suggest that the cognitive task elevated overall arousal to a level unattainable by the endurance task alone, which in turn facilitated performance in the irrelevant endurance task. To further test this arousal account, we manipulated participants motivation to the cognitive task by reward without changing its difficulty and found the same pattern of results. Thus, it is not effort or motivation specific to the cognitive task but rather overall arousal level that underlies the facilitation effect. These results unveiled a previously overlooked mechanism: a multitasking-induced arousal boost. Our findings suggest that multitasking can facilitate performance when the net effect of adding a concurrent task is governed less by the capacity limitation and more by the elevation of overall arousal.

18
The Two Lives of Visual Working Memory: Evidence for Distinct Conscious and Unconscious Representations.

Lipinska, A.; Ciupinska, K.; Rutiku, R.

2026-05-05 neuroscience 10.64898/2026.05.01.722131 medRxiv
Top 0.1%
1.1%
Show abstract

Visual working memory (vWM) is often linked to conscious experience and visual imagery, but it is typically described as a system that stores separate, independent items. These assumptions are difficult to reconcile, given the unified nature of conscious experience. Here, we test the hypothesis that vWM relies on at least two distinct representations: an underlying, unconscious memory trace and a consciously accessible, integrated representation. A total of 216 participants performed a change-detection task, in which they rated their perceptual awareness of the memory display during the maintenance interval. Critically, we manipulated the statistical properties of the displays (average item size and size variability) to probe sensitivity to unified ensemble-level structure. Results revealed a dissociation between subjective and objective measures. Perceptual awareness increased for displays with larger, more variable items, whereas objective performance improved for displays with smaller, less variable items. Despite this difference, subjective awareness still predicted performance, and even incorrect responses showed consistent biases rather than random guesses. Importantly, individual differences in imagery vividness (VVIQ) were selectively associated with subjective awareness and estimation bias, but not with objective correctness. These precision biases were further shaped by display statistics, suggesting that multiple representations can guide behavior. Together, our findings support a reinterpretation of vWM performance in which task responses can draw on both unconscious and consciously accessible representations. One possible explanation for these behavioral patterns is that subjective experience reflects integrated, ensemble-like representations, while objective performance depends more strongly on item-specific information. Public significance statementsWorking memory allows us to temporarily hold and use information, and differences in this ability are closely linked to broader cognitive skills such as intelligence. This study shows that these differences may not depend only on how much information people can store, but also on how they experience it: some individuals appear to rely more on consciously accessible, image-like representations, especially when memory is uncertain or prone to error. By demonstrating that subjective experience and the vividness of imagery can shape behavior independently of objective accuracy, these findings suggest that how we use memory may be as important as how much we can store, with implications for understanding individual differences in cognition.

19
Phasic dopamine drives conditioned responding beyond its role in learning

Hennig, J. A.; Burrell, M.; Uchida, N. A.; Gershman, S. J.

2026-03-25 neuroscience 10.64898/2026.03.25.714259 medRxiv
Top 0.1%
1.1%
Show abstract

Animals exposed to pairings of a neutral stimulus with reward acquire a conditioned response to the neutral stimulus. A prominent hypothesis, formalized in the Temporal Difference (TD) learning algorithm, is that animals learn to predict the future reward associated with the neutral stimulus ("value"). Though the TD algorithm does not explicitly specify what drives conditioned responding, a typical assumption is that it reflects the animals estimate of value. In TD learning, value estimates are updated using reward prediction error (RPE, the discrepancy between observed and predicted reward), and are thought to be signaled by the phasic activity of midbrain dopamine neurons. This hypothesis posits that dopamines effects on conditioned responding are mediated entirely by its effects on learning. However, recent experimental and theoretical evidence suggests that dopamine may play a more direct role in modulating conditioned responding. We use a combination of data analysis and computational modeling to probe the relationship between dopamine and conditioned responding. Our results suggest that dopamine directly modulates conditioned responding, in addition to its role in learning. These findings can be captured by a model in which dopamine RPE acts both indirectly (via learning) and directly on conditioned responding.

20
Investigating Effects of Outcome Controllability and Error Attribution on Proactive Attentional Control: Insights from EEG and Cognitive Modelling

Grote, L. A.; Schneider, D.; Wascher, E.; Arnau, S.

2026-03-05 neuroscience 10.64898/2026.03.03.709239 medRxiv
Top 0.2%
0.9%
Show abstract

Sense of agency (SoA), the experience of controlling ones actions and their consequences, is crucial for self-representation and adaptive goal-directed behavior. Classic comparator models explain SoA as the match between predicted and actual sensorimotor outcomes, whereas inference-based and Bayesian accounts emphasize cue integration and probabilistic weighting. Besides the influence of action-outcome contingencies on SoA, the feedback effect of perceived SoA on cognitive processing is also crucial for cognitive performance. Much of todays cognitive work is performed through interaction with devices that are not entirely reliable or are prone to operator error. Against this background, it is of particular interest whether the impact of an expectancy violation differs depending on whether the outcome is attributed to a malfunctioning system or to ones own mistake. To investigate this, the present EEG study deploys manipulated performance feedback in a color-discrimination task, while EEG was recorded. Thirty-five participants performed in this task with periods of veridical feedback, periods with feedback simulating an increased error rate, and periods of feedback suggesting malfunctioning response buttons. Behavioral performance was decomposed using the EZ-diffusion model, and time-frequency EEG analyses focused on event-related alpha, beta, and theta oscillations. The participants responded significantly slower in the self-attribution of errors condition compared to neutral feedback, and also significantly slower in the system-attribution of errors condition compared to self-attribution of errors. Decomposing behavior using drift-diffusion modeling indicates that a general decrease of response times with manipulated feedback can be attributed to decreased drift rates, whereas the difference between the self and system error conditions are reflected in the non-decision time. In the EEG, the manipulated feedback was reflected in attenuated decreases of occipital alpha and sensorimotor beta power during the cue-target interval. Furthermore, system-versus self-attributed errors elicited stronger feedback-locked midfrontal theta responses. Our findings suggest a functional dissociation within the agency inference process, where perceived controllability regulates preparatory investment of cognitive resources, while the attribution of action-outcome discrepancies seem to modulate sensory processes or motor-execution.